NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Transfer Learning in Deep Reinforcement Learning: A Survey

https://doi.org/10.1109/TPAMI.2023.3292075

Zhu, Zhuangdi; Lin, Kaixiang; Jain, Anil K; Zhou, Jiayu (November 2023, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations

https://doi.org/10.1609/aaai.v36i8.20914

Zhu, Zhuangdi; Lin, Kaixiang; Dai, Bo; Zhou, Jiayu (June 2022, Proceedings of the AAAI Conference on Artificial Intelligence)

Reinforcement learning (RL) has demonstrated its superiority in solving sequential decision-making problems. However, heavy dependence on immediate reward feedback impedes the wide application of RL. On the other hand, imitation learning (IL) tackles RL without relying on environmental supervision by leveraging external demonstrations. In practice, however, collecting sufficient expert demonstrations can be prohibitively expensive, yet the quality of demonstrations typically limits the performance of the learning policy. To address a practical scenario, in this work, we propose Self-Adaptive Imitation Learning (SAIL), which, provided with a few demonstrations from a sub-optimal teacher, can perform well in RL tasks with extremely delayed rewards, where the only reward feedback is trajectory-wise ranking. SAIL bridges the advantages of IL and RL by interactively exploiting the demonstrations to catch up with the teacher and exploring the environment to yield demonstrations that surpass the teacher. Extensive empirical results show that not only does SAIL significantly improve the sample efficiency, but it also leads to higher asymptotic performance across different continuous control tasks, compared with the state-of-the-art.
more » « less
Full Text Available
RCA: A Deep Collaborative Autoencoder Approach for Anomaly Detection

https://doi.org/10.24963/ijcai.2021/208

Liu, Boyang; Wang, Ding; Lin, Kaixiang; Tan, Pang-Ning; Zhou, Jiayu (August 2021, Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI-21))
null (Ed.)
Unsupervised anomaly detection plays a crucial role in many critical applications. Driven by the success of deep learning, recent years have witnessed growing interests in applying deep neural networks (DNNs) to anomaly detection problems. A common approach is using autoencoders to learn a feature representation for the normal observations in the data. The reconstruction error of the autoencoder is then used as outlier scores to detect the anomalies. However, due to the high complexity brought upon by the over-parameterization of DNNs, the reconstruction error of the anomalies could also be small, which hampers the effectiveness of these methods. To alleviate this problem, we propose a robust framework using collaborative autoencoders to jointly identify normal observations from the data while learning its feature representation. We investigate the theoretical properties of the framework and empirically show its outstanding performance as compared to other DNN-based methods. Our experimental results also show the resiliency of the framework to missing values compared to other baseline methods.
more » « less
Full Text Available
Federated Learning’s Blessing: Fedavg Has Linear Speedup

Qu, Zhaonan; Lin, Kaixiang; Li, Zhaojian; Zhou, Jiayu (May 2021, ICLR 2021 - Workshop on Distributed and Private Machine Learning (DPML))
null (Ed.)
Full Text Available
Off-Policy Imitation Learning from Observations

Zhu, Zhuangdi; Lin, Kaixiang; Dai, Bo; Zhou, Jiayu (December 2020, the Thirty-fourth Annual Conference on Neural Information Processing Systems (NeurIPS 2020))
null (Ed.)
Full Text Available
Ranking Policy Gradient

Lin, Kaixiang; Zhou, Jiayu (April 2020, The Eighth International Conference on Learning Representations (ICLR 2020))
null (Ed.)
Full Text Available

Search for: All records